pattern distribution
Equilibrium flow: From Snapshots to Dynamics
Scientific data, from cellular snapshots in biology to celestial distributions in cosmology, often consists of static patterns from underlying dynamical systems. These snapshots, while lacking temporal ordering, implicitly encode the processes that preserve them. This work investigates how strongly such a distribution constrains its underlying dynamics and how to recover them. We introduce the Equilibrium flow method, a framework that learns continuous dynamics that preserve a given pattern distribution. Our method successfully identifies plausible dynamics for 2-D systems and recovers the signature chaotic behavior of the Lorenz attractor. For high-dimensional Turing patterns from the Gray-Scott model, we develop an efficient, training-free variant that achieves high fidelity to the ground truth, validated both quantitatively and qualitatively. Our analysis reveals the solution space is constrained not only by the data but also by the learning model's inductive biases. This capability extends beyond recovering known systems, enabling a new paradigm of inverse design for Artificial Life. By specifying a target pattern distribution, we can discover the local interaction rules that preserve it, leading to the spontaneous emergence of complex behaviors, such as life-like flocking, attraction, and repulsion patterns, from simple, user-defined snapshots.
An Efficient Multi-Robot Arm Coordination Strategy for Pick-and-Place Tasks using Reinforcement Learning
Jermann, Tizian, Kolvenbach, Hendrik, Estay, Fidel Esquivel, Kramer, Koen, Hutter, Marco
LASTIC pollution in rivers has become a pressing global issue, with 11 million tons of plastic waste entering the ocean annually, 80% of which is caused by 1,000 major polluting rivers [1]. To address this problem, it is desired to develop a solution capable of removing plastic and other waste objects without interfering with the existing flora and fauna essential to river ecosystems [2] . Our Autonomous River Cleanup (ARC) project, initiated in 2019, leverages robotics and automation to remove plastic waste from rivers. In order to increase the capacity at which this can be done, we enhance the existing single arm sorting station [3] with additional robot arms. For multiple robot agents to efficiently sort waste on a conveyor belt, we develop and evaluate novel strategy algorithms using reinforcement learning that assign pick-and-place (PnP) tasks to the respective robot agents (Figure 1). Given a set of objects on the moving conveyor belt, the robot agents are tasked with removing waste objects, whilst bio-matter is ignored and collected at the end of the belt. The challenge is to allocate each robot optimally with PnP operations for objects within its reachable workspace.
Ordinal analysis of lexical patterns
Sanchez, David, Zunino, Luciano, De Gregorio, Juan, Toral, Raul, Mirasso, Claudio
Words are fundamental linguistic units that connect thoughts and things through meaning. However, words do not appear independently in a text sequence. The existence of syntactic rules induces correlations among neighboring words. Using an ordinal pattern approach, we present an analysis of lexical statistical connections for 11 major languages. We find that the diverse manners that languages utilize to express word relations give rise to unique pattern structural distributions. Furthermore, fluctuations of these pattern distributions for a given language can allow us to determine both the historical period when the text was written and its author. Taken together, our results emphasize the relevance of ordinal time series analysis in linguistic typology, historical linguistics and stylometry.
Minimax rate of consistency for linear models with missing values
Ayme, Alexis, Boyer, Claire, Dieuleveut, Aymeric, Scornet, Erwan
Missing values are more and more present as the size of datasets increases. These missing values can occur for a variety of reasons, such as sensor failures, refusals to answer poll questions, or aggregations of data coming from different sources (with different methods of data collection). There may be different processes of missing value generation on the same dataset, which makes the task of data cleaning difficult or impossible without creating large biases. In his leading work, Rubin [1976] distinguishes three missing values scenarios: Missing Completely At Random (MCAR), Missing At Random (MAR), and Missing Not At Random (MNAR), depending on the links between the observed variables, the missing ones, and the missing pattern. In the linear regression framework, most of the literature focuses on parameter estimation [Little, 1992, Jones, 1996], using sometimes a sparse prior leading to the Lasso estimator [Loh and Wainwright, 2012] or the Dantzig selector [Rosenbaum and Tsybakov, 2010]. Note that the robust estimation literature [Dalalyan and Thompson, 2019, Chen and Caramanis, 2013] could be also used to handle missing values, as the latter can be reinterpreted as a multiplicative noise in linear models.
Phase Diagram of Restricted Boltzmann Machines and Generalised Hopfield Networks with Arbitrary Priors
Barra, Adriano, Genovese, Giuseppe, Sollich, Peter, Tantari, Daniele
Restricted Boltzmann Machines are described by the Gibbs measure of a bipartite spin glass, which in turn corresponds to the one of a generalised Hopfield network. This equivalence allows us to characterise the state of these systems in terms of retrieval capabilities, both at low and high load. We study the paramagnetic-spin glass and the spin glass-retrieval phase transitions, as the pattern (i.e. weight) distribution and spin (i.e. unit) priors vary smoothly from Gaussian real variables to Boolean discrete variables. Our analysis shows that the presence of a retrieval phase is robust and not peculiar to the standard Hopfield model with Boolean patterns. The retrieval region is larger when the pattern entries and retrieval units get more peaked and, conversely, when the hidden units acquire a broader prior and therefore have a stronger response to high fields. Moreover, at low load retrieval always exists below some critical temperature, for every pattern distribution ranging from the Boolean to the Gaussian case.
Painting Analysis Using Wavelets and Probabilistic Topic Models
Wu, Tong, Polatkan, Gungor, Steel, David, Brown, William, Daubechies, Ingrid, Calderbank, Robert
PAINTING ANALYSIS USING WAVELETS AND PROBABILISTIC TOPIC MODELS Tong Wu, Gungor Polatkan, David Steel, William Brown, Ingrid Daubechies and Robert Calderbank ABSTRACT In this paper, computer-based techniques for stylistic analysis of paintings are applied to the five panels of the 14th century Peruzzi Altarpiece by Giotto di Bondone. Features are extracted by combining a dual-tree complex wavelet transform with a hidden Markov tree (HMT) model. Hierarchical clustering is used to identify stylistic keywords in image patches, and keyword frequencies are calculated for sub-images that each contains many patches. A generative hierarchical Bayesian model learns stylistic patterns of keywords; these patterns are then used to characterize the styles of the sub-images; this in turn, permits to discriminate between paintings. Results suggest that such unsupervised probabilistic topic models can be useful to distill characteristic elements of style. Index Terms -- Painting Analysis, Wavelet Transforms, Hidden Markov Trees, Topic Models, Machine Learning 1. INTRODUCTION In recent years wavelet methods have contributed to art history through their application to forgery detection [1], linking of underdrawing and overpainting [2], and uncovering elements of style [3, 4].